How to Safely Exploit Predictions in General-Sum Normal Form Games
ثبت نشده
چکیده
Given a prediction of opponent behavior in a generalsum two-player normal form game, it is difficult to select a strategy that balances the opportunity to use the prediction to inform one’s action with the risk of becoming predictable. We propose Restricted Stackelberg Response with Safety (RSRS), a novel way of generating such a strategy. RSRS uses an r-safe Stackelberg equilibrium in a modified game, which is created to reflect the assumption that the prediction might be inaccurate. With appropriate parameter selection, RSRS produces strategies that can play well against the prediction, respond well against a best-responding opponent, or guard against worst-case outcomes. We describe an algorithm that selects appropriate parameter values, which we have tested on multiple general-sum games, comparing its performance to that of other algorithms.
منابع مشابه
How to safely exploit predictions in general-sum normal form games
Given a general-sum normal form game and a prediction of opponent behavior it is difficult to select a strategy which balances the opportunity to exploit the prediction with the risk of being exploited. We propose Restricted Stackelberg Response with Safety (RSRS), a novel way of generating such a strategy. RSRS uses an r-safe Stackelberg equilibrium in a modified game, which is created to refl...
متن کاملHow to Safely Exploit Predictions in General-Sum Normal Form Games
Given a prediction of opponent behavior in a generalsum two-player normal form game, it is difficult to select a strategy that balances the opportunity to use the prediction to inform one’s action with the risk of becoming predictable. We propose Restricted Stackelberg Response with Safety (RSRS), a novel way of generating such a strategy. RSRS uses an r-safe Stackelberg equilibrium in a modifi...
متن کاملSafely Using Predictions in General-Sum Normal Form Games
It is often useful to predict opponent behavior when playing a generalsum two-player normal form game. However best-responding to an inaccurate prediction can lead to a strategy which is vulnerable to exploitation. This paper proposes a novel method, Restricted Stackelberg Response with Safety (RSRS), for an agent to select a strategy to respond to a prediction. The agent uses the confidence it...
متن کاملAn Approach to Cooperation in General-Sum Normal Form Games
My primary research interest is social behavior for software agents to achieve cooperation in general-sum normal form games. An agent can easily be programmed to constantly cooperate in a normal form game, but such an agent is not suitable for environments with potentially hostile opponents. For a rational agent, the main reason to cooperate is to induce reciprocation therefore it is necessary ...
متن کاملEquilibrium play and best response to (stated) beliefs in normal form games
We report experimental results on a series of ten one-shot two-person 3 × 3 normal form games with unique equilibrium in pure strategies played by non-economists. In contrast to previous experiments in which game theory predictions fail dramatically, a majority of actions taken coincided with the equilibrium prediction (70.2%) and were best-responses to subjects’ stated beliefs (67.2%). In cons...
متن کامل